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Abstract 

We consider a class of stochastic impulse control problems of general stochastic processes 
i.e. not necessarily Markovian. Under fairly general conditions we establish existence of 
an optimal impulse control. We also prove existence of combined optimal stochastic and 
impulse control of a fairly general class of diffusions with random coefficients. Unlike, in 
the Markovian framework, we cannot apply quasi- variational inequalities techniques. We 
rather derive the main results using techniques involving reflected BSDEs and the Snell 
envelope. 

AMS subject Classifications: 60G40; 60H10; 62L15; 93E20; 49N25. 

Keywords: stochastic impulse control; Snell envelope; Stochastic control; backward stochastic 
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1 Introduction 

Finding a stochastic impulse control policy amounts to determining the sequence of random 

dates at which the policy is exercised and the sequence of impulses describing the magnitude of 

the applied policies, which maximizes a given reward function. Given the general applicability 
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of stochastic impulse control models in various fields such as finance, e.g. cash management 
(see Korn (1999) for an excellent survey and the textbook by Jeanblanc et al. (2005) and the 
references therein), and management of renewable resources (see e.g. Alvarez (2004), Alvarez 
and Koskel (2007) and the references therein), it is not surprising that the mathematical 
framework of such problems is well established (see Lepeltier-Marchal (1984), 0ksendal and 
Sulem (2006) and the references therein and the seminal textbook by Bensoussan and Lions 
(1984) on quasi- variational inequalities and impulse control). Indeed, in most cases, the impulse 
control problem is studied relying on quasi- variational inequalities, which is possible only 
through tacitly assuming that the underlying dynamics of the controlled system is Markovian 
and the instantaneous part of the reward function a deterministic function of the value of the 
process at a certain instant. These assumptions are obviously not realistic in most applications, 
such as in certain models in commodities trading. Even if the underlying process is Markov, 
the instantaneous part of the reward function may depend on the whole path of the process 
or is simply random. 

In this study we consider a class of stochastic impulse control problems where the underlying 
dynamics of the controlled system is typically not Markov and where the instantaneous reward 
functional is random, in which case, we cannot rely on the well established quasi-variational 
inequalities technique to solve it. Instead, we solve the problem using techniques involving 
reflected BSDEs and the Snell envelope that seem suit well this general situation. The main 
idea is to express the value-process of the control problem as a Snell envelope and show that 
it solves a reflected BSDE, whose existence and uniqueness are guaranteed provided some 
mild integrability conditions of the involved coefficients. This is done through an appropriate 
approximation scheme of the system of reflected BSDEs that is shown to converge to our 
value process. The underlying approximating sequence is shown to be the value process of 
an impulse control over strategies which have only a bounded number of impulses, for which 
an optimal policy is also shown to exist. Finally, passing to the limit, letting the number 
of impulses become large, we prove existence of an optimal policy of our stochastic impulse 
control problem. 

The paper is organized as follows. In Section 2 we recall the main tools on reflected BSDEs 
and Snell envelope we will use to establish the main results. In Section 3, we formulate the 
considered stochastic impulse control. In Section 4, we consider an appropriate approximation 
scheme of the system of reflected BSDEs that is shown to converge to our value process. In 
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Section 5, we establish existence of an optimal impulse control over strategies with a bounded 
number of impulses, in Section 6, we prove existence of an optimal impulse control over all 
admissible strategies. Moreover, the corresponding value process is the limit of the sequence 
of value processes associated with the optimal impulse control over finite strategies, as their 
number becomes large. Finally, in Section 7, we consider a mixed stochastic control and 
impulse control problem of a fairly large class of diffusion processes that are not necessarily 
Markovian. Using a Benes-type selection theorem, we derive an optimal policy using similar 
tools. 

2 Preliminaries and notation 

Throughout this paper (fi, JF, IP) is a fixed probability space on which is defined a standard 
ti-dimensional Brownian motion B = (\B>t)o<t<T whose natural filtration is {J^ := a{M s , s < 
t})o<t<T ; (Ft)o<t<T is the completed filtration of (J-f)o<t<T with the iP-null sets of J 7 , hence 
(J-'t)o<t<T satisfies the usual conditions, i.e., it is right continuous and complete. Let 

• be the a-algebra on [0, T] x $7 of ^-progressively measurable processes. 

• for any p < 2, TL p,k be the set of ^-measurable processes v = (u*)o<t<T with values in 
R k such that E[/ Q T \v s \ p ds] < oo. 

• S 2 (resp. S 2 ) be the set of ^-measurable and cadlag (abbreviation of right continuous 
and left limited) (resp. continuous) processes Y = (^t)o<t<T such that E[sup 0<t<T \ Y t \ 2 \ < 
oo. 

• Sf (resp. S 2 ^ the set of non-decreasing processes k = (h)o<t<T of S 2 (resp. S 2 ) which 
satisfy fco = 0. 

• for t < T, % the set of ^-stopping times v such that IP — a.s., t < v < T. Finally for 
any stopping time v, T v is the <r-algebra on Q which contains the sets A of J 7 such that 

An{v < t} e T t . □ 

Consider now an 5 2 -process X = (X t )o<t<T- The Snell envelope of X, which we denote 
by N(X) = (N(X) t )o<t< T , is defined as 

IP - a.s. N(X) t = ess sup^E^I^], < t < T. 
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It is the smallest cadlag {Tt, iP)-supermartingale of class [D] (see the appendix for the defini- 
tion) which dominates X, i.e., IP — a.s., N{X) t > X t , for all < t < T. 

For the sequel, we need the following result related to the continuity of the Snell envelope 
with respect to increasing sequences whose proof can be found in Cvitanic and Karatzas (1996) 
or Hamadene and Hdiri (2007). 

Proposition 2.1. Let (U n ) n >i be a sequence of cadlag and uniformly square integrable pro- 
cesses which converges increasingly and pointwisely to a cadlag and uniformly square integrable 
process U, then (N(U n )) n >i converges increasingly and pointwisely to N(U). 

In the Appendix at the end of the paper, we collect further results on the Snell envelope we 
will refer to in the rest of the paper. 

Let us underline that in the Markovian case, the problem under consideration is solved us- 
ing PDEs techniques. However, in our framework, we can no longer apply these techniques. 
Instead, we use backward stochastic differential equations (BSDEs in short) which we will 
introduce with others properties. 

Let X = (X t ) <t<T be a barrier process of S 2 and / : [0,T] x O x R 1+d i-> R a drift 
coefficient such that (f(t, u>, 0, 0))o<t<T & H 2,1 and uniformly Lipschitz in (y,z), i.e. there 
exists a constant C > such that 

\f(t,y,z) - f(t,y',z')\ < C(\y - y'\ + \z - z'\) for any t,y,z,y' and z' . 

Then we have the following 

Theorem 2.1. {Hamadene {2002)). There exists a unique ^-measurable triple of processes 
(Y,Z,K) = {Y t , Z t , K t )o<t<r with values in R 1+d+1 solution of the reflected BSDE associated 
with (/, X), i.e., 

' Y e S 2 , Z e H 2 > d and K € S 2 , 

Y t = X T + f f{s,Y s , Z s )ds + K T -K t - I Z s dM s , < t < T, 
< Jt Jt 

Y t > X t , for all 0<t<T, 

/ T (y t - X t )dK- = 0, and A t Y := Y t - Y t _ = -{X t _ - Y t )+ %_y ( _ <0 ], 
where K c is the continuous part of K. Moreover, Y admits the following representation. 

IP -a.s., Y t = esa 8up TeTt E[J f{s,Y s ,Z s )ds + X T \F t ], t<T. (2.1) 
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In addition, if X is left upper semi- continuous, i.e., it has only positive jumps, then the process 
Y is continuous. 

From (|2.ip we note that (Y t + J ' f(s, Y s , Z s )ds)o< t <T is the Snell envelope of the process 
(f* f(s, Y a , Z s )ds + X t )o< t < T . 

In view of the results in El-Karoui et al. (1995), solutions of BSDEs with one reflecting barrier 
can be compared when we can compare the generators, the terminal values and the barriers. 
This remains true in this framework of discontinuous processes. Indeed, the following result 
holds. 

Proposition 2.2. (Hamadene (2002)) Let f (resp. X) be another map from [0,T] x ft x M. 1+d 
into R (resp. another process of S 2 ) such that: 

(i) there exists a process (Y,Z,K) = (Yt, Zt, Kt)t<T solution of the reflected BSDE associ- 
ated with (f,X) 



(ii) IP - a.s. yt < T, f(t, Y t , Z t ) < f(t, Y t , Z t ) 
(Hi) IP - a.s., for all t < T, X t < X t . 
Then, we have IP — a.s., for all t < T , Yt < Yt- □ 

Now, let us consider a sequence (y n ,z n ,k n ) n >i of processes defined as follows: 
(y n , z n , k n ) G S c 2 x H 2 > d x S 2 c i , 

V n t =Vt + j t /(*, * n s)ds + k%-k?-J z n s M s , t < T, 

y? > X t , for all t < T, and f (y t n - X t )dk£ = 0. 
> Jo 

We now recall the following result by S. Peng (1999) which generalizes a well know property 
of supermartingales which tells that an increasing limit of cadlag supermartingales is a also a 
cadlag supermartingale. 

Proposition 2.3. (Peng (1999, pp. 485)) Assume the sequence (y n ) n >o converges increasingly 
to a process (yi)o<t<T such that E[sup 0<t<T \yt\ 2 ] < oo , then there exist two processes (z, k) € 
H 2 > d x S 2 such that 

yt = VT+ / f(s, y s , z s )ds + k T -k t - / z s M s . 
Jt Jt 

In addition, z is the weak (resp. strong) limit of z n in H 2,d (resp. in 7i p,d , for p < 2) and for 
any stopping time r, the sequence (fc") n >o converges to k T in L p (dP). 



In this result, the assumption E[sup 0<t<T \yt\ ] < oo can be replaced by E[sup n>1 sup t<T ] < 
oo. 

3 Formulation of the impulse control problem 

Let L = (L t )o<t<T be a stochastic process that describes the evolution of a system. We assume 
it ^-measurable, with values in R ; and is such that E[J" T |L s | 2 ds] < oo. An impulse control 
is a sequence of pairs 5 = (r n ,^ n ) n >o in which (T Tt ) n >o is a sequence of J-t-stopping times such 
that < To < 7~i < . . . < T JP-a.s. and (£ n )n>o a sequence of random variables with values 
in a finite subset U of M. 1 such that ^ n is T Tn -measurable. Considering the subset U finite is 
in line with the fact that, in practice, the controller has only access to limited resources which 
allow him to exercise impulses of finite size. 

The sequence 5 = (r n , £ n )n>o is said to be an admissible strategy of the control, and the set of 
admissible strategies will be denoted by A. The controlled process L = (L^)o<t<T is described 
as follows: 
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T° 



f L t , if < t < t , 

~ ~ (3.2) 

L t + £ n , if T n < t < r n+ i, n > 0, 



or, in compact form, 



L s t =L t + J2^[r n <t], 0<t<T. 

n>0 

The associated reward of controlling the system is 

J(S) = E[ f h( S , CO, L S s )ds - ^n)\ n< T]], 
J ° n>0 

where h, represents the instantaneous reward and ip the costs due to the impulses. 

This formulation of impulse control also falls within the class of singular stochastic control 
problems, since the bounded variation part of the process, which controls the dynamic of the 
system, is allowed to be only purely discontinuous- See 0ksendal and Sulem (2006) for further 
details. Finally, note that if for example the process L satisfies 

L t = Lq + / b(s, oj)ds + / a(s,uj)dB s , t<T, 
Jo Jo 

where, (6(s))o<s<T and (<t(s))o< s <t are adapted stochastic processes, the existing theory on 
impulse control cannot be applied to the associated problem, since the processes b and a are 
random. 



6 



We make the following assumptions on h and ip. 
Assumption (A) 

(Al) h : [0, T] x $7 x R l — ► [0, +oo) is uniformly bounded by a constant 7 in all its arguments 
i.e. for any (t, u, x) £ [0, T] x Q x R l , < h(t, u, x) < 7. 

(A2) tp : U — ► [0, +00) is bounded from below, i.e. there exists a constant c > such that 
infpeu ^(P) > c- 

Assumption (A2) is motivated by the following form of proportional and fixed transaction costs 
(see Korn (1999) or Baccarin and Sanfelici (2006) for further examples). 

where <p > 0, (f)(0) = and c is positive constant. 

Definition 3.1. A strategy 5* £ A such that 

J(S*) =supJ(<5) (3.3) 

is called optimal. 

The properties of h and ip make the supremum of the reward function J over the set A coincides 
with the one over the set of finite strategies, V defined as 

V = {5 = (r n ; (3 n )n>o e A; F(r n (w) <T, n > 0) = 0}. 

That is, 

supJ(e>) =supJ((5). 
SeA sev 

Indeed, consider a strategy 5 = (r n ; P n )n>o of A which does not belong to V and let B = {uo £ 
VL; T n (uj) < T, n > 0}. Since S is not finite, P(B) > 0. But, since h is bounded, we have 

AS) = E[f T h( S ,L S s )ds-J2^n)l[r n <T]] 
J ° n>0 

< 7T-E[(^^(/3r i )l[.„<T])lB-(E^^«)) IL ^<r]) 1 ^ 

n>0 n>0 

= — OO, 

whence the desired result. 
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4 An approximation scheme 

For any stopping time v and an ^—measurable random variable £, let (Y®(v,l;), Z^{y, £))o<t<T 
be the solution in S 2 x ri 2,d of the following standard BSDE : 

Y t °(v, = [ T h(s, L s + Ol [a > v] ds - [ T Z° s {u, £)M S , 0<t<T. (4.4) 
Jt Jt 

The solution of this BSDE exists and is unique by the well known Pardoux-Peng's Theorem 

(see Pardoux and Peng (1990)) since the terminal value is null and the function h is bounded. 

Next, for any n > 1, let (Y t n (v, £), K™(v, £), Z™(v, £))o<t<r be the sequence of processes defined 

recursively as solutions of reflected BSDEs in the following way: 

' (Y n (v,0,Z n (v,0,K n (v,0) G S 2 x H 2 > d x S 2 ,, 

rp rp 

Y t n (v,0= f h(s,L s + 0Ms>v]ds + K2(v,Z)-K?(v,0- [ Z?(v,Z)M a , < t < T, 
Jt Jt 

Y t n (v,0 > 0^,0 := maxpeu{-WP) + + P)}, 0<t<T, 

[ T (Y?&Z)-O?(u,t))dK?(v,Z) = 0. 
^ Jo 



(4.5) 

Proposition 4.1. For any n > 0, v G Tq and any T v -measurable r.v. £, the triple 
(Y n (i',£ i ),K n (i i ,£ t ),Z n (i',£)) of |^.5[ ) is well posed. Moreover, it satisfies the following proper- 
ties. 

(i) F-a.s. 0<Y t n (v,0<Y t n+1 (v,0, < t < T. 

(ii) F-a.s. Y t n (v,£) <-y(T -t), < t < T. 

Proof: We prove the result by induction on n. We first begin to show the well-posedness 
of {Y n {u, £), K n (y, £), Z n (v, £)) for any n > 0. As pointed out previously for n = 0, for any 
stopping time ^ and any .^-measurable r.v. £, the pair (y°(i/, £), ,Z (z/, £)) exists and belongs 
to S 2 xH 2 > d . Suppose now for some n > 1, for any stopping time f and any .F^-measurable r.v. 
£, the triplet (y n (z^, 0, i^ n (^, £), ^ n (^, 6) exists and belongs to «S 2 x x H 2 > d . Hence, thanks 
to the finitness of U, (0™ +1 (^, £))o<t<T is a continuous process and satisfies O£ + V>0 < 0. 
In view of Theorem 12. 1\ the triplet (y n+1 (z^, £), K nJrl (u, £), Z n+1 (i/, £)) exists and belongs to 
S 2 x S 2 ri x "H 2 ' d . Thus, for any n > 0, any stopping time z/ and any J>-measurable r.v. £, the 
triplet (Y n (v,t),K n (v,£),Z n (v,£)) exists and belongs to S 2 x S 2 4 x H 2 - d . 

Let us now show (i) and (ii). Once more we will use an induction argument. First writing 
Y®(v, £) as a conditional expectation w.r.t. .Ft and taking into account of < h < 7 we 
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obtain that < Y®(v, £) < j(T — t), for any stopping time v and any Jv-measurable r.v. £. 
Next, as K l (v,£) is an increasing process then using standard comparison result of solutions of 
BSDEs (see e.g. El-Karoui et al. (1995)), we obtain Y°(v, £) < Y l (v, £). Therefore, Properties 
(i) and (ii) hold for n = 0. Suppose now that for some n, for any stopping time v and 
any ^v-measurable r.v. £, (i) and (ii) hold. Then, O n+1 (y, £) < O n+2 (z/, £) and then the 
characterization (|2,ip implies that Y n+l (y, £) < y n+2 (z^, £). On the other hand, since, for any 



C G T v , Y n (u,C) < l{T - t), it holds that O t n+ >,£) = max /3eC7 (-V'(/3) + + £)) < 

ma X/36C/ (-V(/3) + 7(T - *)) < l(T - t), < t < T. 



Yr + Hu,0=^ssup Tert E[j;h(s,L s + 0ds + O^\u,0^[r<T]\^t},t<T. (4.6) 



and this completes the proof of the claim. □ 

In the next proposition we identify the limit process Yt(v, £) := /im ra ^ooY^ n (^, £) (which 
exists according to the last proposition) as a Snell envelope. Note that, as a limit of a non- 
decreasing sequence of continuous processes, Y(u, £) is upper semi-continuous. Moreover, it 
holds that 



Finally, once more thanks to the finitness of U, the sequence of processes (O n (v, £)) n >o con- 
verges to 0(v, £) as n — ► oo, where, Ot(z/, £) := max^jy [— ^>(/3) + Y t (v, £ + /?)]), < t < T. 

Proposition 4.2. (i) Let v and v' be two stopping times such that v < v' and £ an T v - 

measurable random variable, then it holds that ¥ — a.s., Yt(v, £) = Yt(v', £) for all t>v'. 

(ii) For any stopping time v and ^ — measurable random variable £ ; the process Y(u,£) is 
cadlag and satisfies: 



Y t (v,() = ess sup reTt K[ / h(s,L s + l [s > u] ds + l [r<T] O T (u,0\F t ], t < T. (4.8) 



Now, once more by (I2.ip . we have, for any n > 1 



Therefore 



Y t n+1 < ess sup TeTt E[ 7 (r -t) + j(T - r)\T t ] = j(T - t) 



0<Y t (u,£) <j(T-t), for all t<T, and Y T (i/,£)=0. 



(4.7) 




Proof: (i) We proceed by induction on n. We note that the solution of the BSDE 
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is unique. It follows that, for any £ G T v , Y t (is,t;) = Y t °(z/,£) for any t > v' . Assume now 
that the property holds true for some fixed n. Then O t n+ V,£) = Of +1 (i/,£), Vt > zA Once 
more the uniqueness of the solution of yields F t n+ V,£) = Y^+V^), Vi > i/. Hence 
the property holds true for any n > and the desired result is obtained by taking the limit as 
n — > oo. 

(££) The sequence of processes ( (Y" i n (^, £) + J * /i(s, L s + £)c?s)o<t<T ) is of cadlag super- 

V / n>0 

martingales which converges increasingly and pointwisely to the process 



86) and taking into account (14. 7p . the limit is also a cadlag supermartingale. It follows that the 
process Y(i/,£) is also cadlag . Next, the processes O n (v, £), n > 1, are cadlag and converge 
increasingly to 0(^, £). The rest of the proof is a direct consequence of Proposition 12.11 □ 

Remark 4.1. Propositions \4-l\ and \4-%\ are generalizations of Corollaries 7.6 and 7.7 in 

0ksendal and Sulem (2006). 

5 Optimal impulse control over bounded strategies 

In this section we establish existence of an optimal impulse control over the set of strategies 
which have only a bounded number of impulses. Indeed, for fixed n > 0, let A n be the following 
set of bounded strategies: 

A n = {(r m ,£ m ) m >i G V, such that T n = T,P - a.s.}. 

Then, the following result, which is a generalizations of Theorem 7.2 in 0ksendal and Sulem 




Therefore, according to Dellacherie and Meyer (1980, p. 



(2006), holds. 



Proposition 5.1. For n > 1, we have 



Y n (0,0) = sup 5£An J(5). 



(5.9) 



In addition, there exists a strategy <5* G A n which is optimal, i.e. 



J(5* n ) = sup J (5). 



(5.10) 



SeA, 



Proof. Let 6* = (t^ , (3%) k>o be the strategy defined as follows. 



r " = inf{s > 0; O?(0,0) = 17(0,0)} AT, 
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and 



O?™(0,0) :=m^(-m+Y T y(0,^))=m^(-m+y: i fHr^P)) (5.11) 

= -^(^)+y T r 1 (r ",/3 "), 

and, for any k € {1, . . . , n — 1}, 

rl = inf{, > rJL i; PT*(^-l, ^ + • • • + 0k-l) = YrHrLi, + • ■ ■ + /f-i)} A T, 
and QSr^rJLi, )flff + . . . + = " + YX^^PS + ... + (3 n k _ x + 

k k 

Note that in (|5.1ip we have taken into account the fact that Y^" 1 ^, /?)) = Y^r 1 (tq , /?)) • 
This equality is valid since /? is deterministic and thanks to the uniqueness of the solutions of 
BSDEs (US} which define y n " 1 (0,/3) and Y™- 1 ^/ 3 ) for * > T o • Finally, r ™ = T and #J G 17 
arbitrary. The choice of (3 n is not very significant since there are no impulses at T. We will 
show that 5* is an optimal strategy. 

For any k < n, the random variables 0^ are J- T n — measurable. Thanks to (|2.ip and ()4.5p 
we obtain 

Y ™(0, 0) = su P E[ / T h(s, L s )ds + l [r<T] O?(0, 0)]. 
rer Jo 

Moreover, since the process O n (0,0) is continuous and Oj,(0,0) < 0, then the stopping time 
Tq is optimal after 0. Therefore, 

Y "(0,0) = E[/ ° h(s,L s )ds + TL [T n <T] O?»(0,0)]. (5.12) 



Now, since for any n > 1, 

Q?»(0,0) = max^tK-W) + ^(O,/?)} = ^^{-^(^ + ^"Vo >/?)} 

= _ W)+ yn-l (r n^n ) . 

The second equality is valid since for any j3 G U we have Y r V 1 (0, /?) = Y r V 1 (r n , /?). 
Then, it holds that 

Y "(0,0) =E[ f° ^,L s )d S + l [To » <T1 (-M") + ^" 1 W 1 «)]. 

« 

But, once again using (|2.ip and (|4.5p . we have 



(5.13) 



n^W.ft") = ess sup re7;ri E[ f h(s,L s + PE)ds + l^O"- 1 ^,^^]. 
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and t" is an optimal stopping time after Tq. It yields that 

^fVo^o) = nP h( S ,L s + ffi)ds + l [T n <T] 0^\TZ,ffi)\F T 2] 

•J T n 

h(s, L s + I3%)ds + 1 [t »<t](- + r T r 2 (rr, /ft + ft))\T T ^]. 
By combining the last equality and (|5.12p we get 
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y n (0,0) = E[/ h(s,L s )ds+ h{ S ,L s + ^)ds + \ K<T] {-^)) 

JO Jt™ 

+ i [rr<T] (-^)) + u [rr<T] y r v 2 (rf ,f% + ft)], 

since [rf < T] C [r n < T] and 1 [t « <t] Q h(s, L s + (3%)ds = Q h{s, L s + fi})ds. 
Repeating this argument as many times as necessary yields 

y n (0,0) = E[ / ° h(s,L s )ds+ J h(8,L a +ffi + ... + ffi_ 1 )d 8 

Jo l<k<n-l Jr k-i 

+ E {*K<T](-Hl3k))} + ^K^KTlYfnjT^ffi + ...+ 
0<fc<n-l 

But, according to (|4.4p we have 

y T °n (r n "_ 1 ,/3 " + ...+^_ 1 )=lE[/ T /»( a ,L a + /3? + ... + /^_ 1 )tfa|^v« J- 

Jt" , 



Therefore, 

y "(0,0) = E[f T ° h(s,L s )ds+ E P ^,L s +/3 n + ... + /f_ 1 )^+ £ {l [r «<T](-V(/3 fc n ))}] 

, ' l<fc<n T fc-l 0<k<n 

= E[[ T ° h(s,L s )ds + Y, P h(s,L s + ffi + ... + ffi_ i)ds + J2{l [T n <T] (-i,(Pm] 

•*° fc>l ^ r fc-i fe>0 

= ■/(<£)• 

It remains to show that J(5*) > J(5' n ) for any strategy <5 /n of ^4 n . 

Indeed, let J' n = (t'q , • • • , t'^-i, T, T, . . . ; (3'q, . . . , (3'n-i, P'n: P'n, ■ ■ •) be a strategy of A n . 
Since Tq is optimal after 0, we have 

V(0,0) > E[/"° ^( S ,L s )ds + l [r ,n <T] 0^(0,0)]. 

But, 



o 



0^(0,0) = maxi-m+K^ 1 ^®} = wgf{-1>W+r% 1 <foM > -Wo)+Y^\r' n Q , fi'l) 
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3> n \ i \^n-l/tn ain\ 



Therefore, we have 

1^(0,0) > E[/ h( 8 ,L a )ds + l [Tt n <n (-^) + ^\^,^))] 
Jo 

in in 

> E[ f ° h(s, L s )ds + f 1 h(s, L s + (3 ,n )ds + l^^-Wo)) 



o 



+ l [rT <T]i;V 2 (r-,/3-+ /37)]. 
Finally, iterating as many times as necessary we obtain 

in in 

y n (0,0) > E[[ ° h(s,L s )ds+ V /" * /i(s,L s + / 3' l + ... + /3'"„ 1 )ds 

70 l<k<n J -'k-i 

+ ^ {i [r ,. <T] (-^))}] = j(0- 

0<fc<n 

Hence, J(<S*) > J(5' n ), for any <5'™ G A n The proof is now complete. □ 

6 An optimal impulse control result. 

We now give the main result of this paper. 

Theorem 6.1. Under Assumption (A), the strategy 5* = (t*,/3*)„>o defined by 

r * = inf{s > 0; O s (0, 0) = Y s (0, 0)} A T, 
max(-^(/3) +r T .(0,/3)) = -^(/3 *) + *r -(r *,$), 

/or n > 1, 

r* = inf{s > r*_ i; ^(r^,^* + . . . + #U) = C^t^,/^* + . . . + A T, 

and 

max(-c - m + F r * $ + ... + /£_! + /3)) = -c - V(/?o*) + *r- «, /3 * + ... + + /£)■ 

is optimal for the impulse control problem. 
Furthermore, we have 

y (o,o) = j(«p). 

Proof. The proof is performed in three steps. 
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Step 1. Continuity of the value process (Yt(v,£))o<t<T- We note that, by (|4.8p . we have, for 
any < t < T, 

Y t(v,0+ [ h(s,L s + £)l [s > v] ds = ess sup reTt E[ f h(s, L s + £) l[ s > u ]ds + lr r<T] O t (u, C)|^t], 

JO JO 

meaning that the process (Y t (v,£) + J Q * h(s, L s + £)TL[ s > v ]dsJ is the Snell envelope of 

( Jo* ^( s ' + ^[s>u]ds + l[t<T] Ot(^) C)J T • Therefore, using Proposition 17.41 in the ap- 
pendix below, there exist a continuous martingale M(z/, £) and two increasing processes A(u, £) 
and -B (z/, £) belonging to <S| such that So(f, £) = and, for < t < T, 

[ h(s, L s + l [s > w] ds + Y t (i/, = M t (u, - A t (y, - flt(i/, £)• 

JO 

In addition, the process A(i>,£) is optional and continuous, and B{y,£) is predictable and 
purely discontinuous. The continuity of the value process Y(z/, £), will follow once we show 
that, for any stopping time v and ^v-measurable random variable £, B{u,^) = 0. Indeed, 
let us assume that B(v,£) is different to zero. Since the process is non-decreasing and purely 
discontinuous, there exists r € T u such that B T (v, £) — B T ^{y, £) > 0. Thanks to (I7.32p . in the 
appendix, we have Y T -(v, £) = O t _(i/, £). Hence, 

y T _(i/,e) = max(-^/3) +y r _ (*/,£ + /?)) > Y T (^£) > 0.(^,0 =max(-^(/3) +y(r,e + /?))• 

Therefore, since C7 is finite, there exists (3\ EU such that the set 

Ai = {Y T _(i/,0 = -rp(fa) + y r _(i/,£ + ft) and AY T _(r/,£ + ft) < 0} 

satisfies -P(Ai) > 0. But, the same holds for AY T _(z;, £ + ft). Therefore, there exists ft £ ?7 
such that the set 

A 2 = {y T -(f, e + ft) = -Tp(fo) + Y T -(v, £ + ft + ft) and AY r _(i/, £ + ft + ft) < 0} 

satisfies P[A\ n A2] > 0. It follows that, on the set Ai n A2, we have 

Y T -(y,0 = -V<ft)-^(ft) + Y T _(>,£ + ft + ft). 

Making this reasoning as many times as necessary we obtain the existence of ft, . . . , (3 n elements 
of U and a subset A ra of positive probability such that, on A n , we have 

n 

Yt- (y, = ~Y1 + Y T -(y,£ + Pi + --- + ft) < -™ + jT. 



i=l 
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But, this is impossible for n large enough since the process Y(r,£) is non-negative. Therefore, 
the purely discontinuous process B{y,£) has no jumps and then it is null. Thus, the process 
Y(v,£) is continuous. 

Step 2. The strategy 5* = (r*,^*) n > G V and is such that F o (0,0) = J(5*). 
Using Proposition 14.21 we get 

y (0,0) = ess sup reTo E[ I h{s,L s )ds + 1 [t<t] O t (0,0)]. (6.14) 

J o 

Now, since £) is continuous for any i/£T and any Jv-measurable random variable £ and 
Ot(0, 0) < 0, then the stopping time Tq is optimal for the problem (|6.14p . This yields 



'0 







y (0,0) = E[£ h(s,L s )ds + l [r * <T] O T *(0,0)]. 

But, 



O T .(0,0) = max{-^(/?) + F r *(0,/3) = wax{-ij>(J3) + F T *(r *, £)} = -^(/T) + ^(r Q *,/3*) 
where /?* G J>*. Note that the second equality is valid thanks to Proposition ^. 2\ -(i). Therefore, 

F o (0,0) =E[ f° h(s,L s )ds + l [T * <T] ](-^(p*) + Y t *(tZ,P*))]. 







Next, 



Yr 5 (^,(3* ) = E[/ Tl h(s,L s + P* )ds + l [Tr<T] O Tt (^,P* Q )\^ TS ] 

= E[ / h( S ,L s + p* )ds + l [r;<T] (-i;(Pt) + F Tf (T 1 * ) ^ *+/3 1 *))|^]. 
Replacing Y t *(tq,/3q) by its expression in (|6.15p . we obtain 



y (0,0) = E[J h(s,L s )ds + h(s,L s + P* )ds + HKO%<T] 

+ (-^(/3 1 *))l [Tf <T]+ ^K'^ + ^l) 1 !^] 

since [r* < T] C [tq < T] and [tq < T] € y>*. Proceeding in the same way as many times as 
necessary we get 

F (0, 0) = E[ f T ° h(s, L s )ds + ...+ r h(s, L s + (3* + ...+ fa_ x )da + (-*K/?o))%<T] + 
+ (-^*))l [r , <T]+ Y r *(r*,^ + ... + /3*_ 1 + ^ n )l K<T] }. (6. 
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Let us now show that S* £ T>. Assume that P{t* < T; n > 0} > 0. Then we have 

y (0,0) < E[/ r ° |fc( S ,L s )|ds + ... + r \h(s,L s + P* + ... + P*_ 1 )\ds+ sup |y s (r*, $ + ... + /?* 

Jo ./<_, s<T 

+ 1{7*<T; n>0} Y (-^(/S)) 1 ^*^] + l{r*<T; n>0} c ^ HH/S)) 1 ^*^]] 
0<fe<n 0<fc<n 

< 7^+ E[sup|y s «,/3 * + ...+/?*) |]-ncP{r* < T; n > 0}. 

s<T 

The last quantity tends to — oo as n — > oo, then Yq(0,0) = — oo which contradicts the fact 
that y(0,0) G S 2 . Therefore, P{r* < T; n > 0} = i.e. 5* E D. Finally, by taking limit as 
n -> oo in ([fU5D we obtain Y (0,0) = J {5*). 

Step 3. J (5*) > J(<5) /or any strategy 5 E A. Let J = (r n ,/3 n ) n > be a finite strategy. Since 
Tq is optimal after 0, we have 

f-TQ 

y (0,0) > E[/ /i( S ,L s )^ + l [T0<T] O T0 (0,0)] 
Jo 



> E[/_ h(s,L s )ds + l [T0<T] {-^(P ) + Y T0 (r Q ,f3 )}}. 

But, 



o 



O T0 (0,0) = max{-^(/3) + Y TO (0,P)} = max{-^(/3) + ^.(tq,/?)} > ~HPo) + l^fo, AO 



It follows that 



Next, 



[TO 

y (0,0) > E[ / / i ( S) L s )ds + l [ro<T] {-^(/3 ) + y o (ro,/?o)}] 

JO 



y- () (T-o,/3o) = ess sup reTro E[ / /i(s, L s + /3 )ds + l[ T<r ]O T (r , /3o)|^v ] 

> E[ r h(s, L s + 0o)ds + l [T1<T] {-V(/3i) + y n (ri, ft, + Pi)}\F n ]. 



TO 



Therefore, 

y (0,0) > E[/ °h(s,L s )ds+ ( 1 h(s,L s + (3 Q )ds + (-ifj(Po))^[r <T] 

JO J T 

+ (-^(A))i [n <T]) + i^Tiy^n, A) + A)]- 

Now, by following this reasoning as many times as necessary we obtain, 

/TO PTk 
h(s,L s )ds+ Y, / h(a,L a + Po + ...+0 k -.i)d8 

+ E (-m))l[r fc <T]+yr„(T n ,/3o + ... + /?n)] 
0<fc<n 
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and since the strategy S is finite, by taking the limit as n — > oo, we obtain Yq(0, 0) > J (6) since 
\Y Tn {T n ,f3 + ...+(3 n )\ < 7l[r„<T]- As (5 G .4 is arbitrary, then Y (0,0) = J(5*) = sup SeV J{6) = 
sup 8eA J(5). □ 

Corollary 6.1. Under Assumptions (A) and (B) it holds that 



7 Combined stochastic and impulse controls 

In this section we study a mixed stochastic and impulse control problem, where, we allow the 
process L, that describes the evolution of the system and subject to impulses, to also depend 
on a control u from some appropriate set V. Therefore, the dynamics of the system is subject 
to a combination of control and impulses. To begin with, we describe this dynamics. 

Let C be the set of continuous functions w from [0, T] into R d endowed with the uniform norm. 
For t < T, let Gt be the cr-field of C generated by {ir s '■ w i-> w s , s < t}. By G we denote the 
cr-field on [0, T] x C consisting of all the subsets G, which have the property that the section 
of G at time t is in Qt and the section of G at w is Lebesgue measurable (see Elliott (1976) for 
more details on this subject). Finally if w 6 C and a is a deterministic function then w + a is 
the function which with t £ [0, T] associates (w + a) t = w t + a. 

Let us now consider a function from [0, T] x C — > R d which satisfies the following 

Assumption (H). 

(HI) cr is G-measurable and there exists a constant k such that 

(i) for every t G [0, T] and every w and w' in C, \a(t,w) — a(t,w')\ < k\\w — w'\\t where 
IMIt = sn Ps<t \ w s\, t < T; 

(ii) for every t G [0, T], \a(t,0)\ < k, a is invertible and its inverse cr -1 is bounded. 

Let V be a compact metric space and "V the set of ^—measurable processes v = (vt)t<T with 
values in V. Hereafter, V is called the set of admissible controls. 

We consider now the process (L t )o<t<T which is the unique solution for the following stochastic 
differential equation: 




(6.16) 



< 



dL t = a(t,L.)M t , 0<t<T, 
L = x, x G R d , 



V 
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whose existence is guaranteed by Assumption (HI). The process L stands for the state of the 
system when non-controlled. 

Let / and (resp. h) be a measurable and uniformly bounded function from [0, T] x C x V into 
R d (resp. R+) such that 

(H2) / and h are G <g> S(V)-measurable 

(H3) for every t G [0, T], w € C, the function which with u G V associates f(t,w,u) (resp. 
h(t,w,u)) is continuous. 

Now, given a control let iP n be the probability measure on defined by 

^ = exp{/ T a- 1 ^, L.)/( S , L, u s )dB s - I J T \a~\s, L.)f(s, L.,u s )\ 2 ds}. 

Thanks to Girsanov's Theorem (see e.g. Revuz and Yor (1991)), for every uef the process 
M u := (M t — Jg a~ 1 (s,L.)f(s,L.,u s )ds^ is a Brownian motion on (0,,J r ,]P u ), and L is a 

weak solution for the following functional differential equation. 

dL t = f(t,L.,u t )dt + a{t,L.)M%, < t < T, 
L = x. 

Under 2P U , the process L represents the evolution of the system when controlled by (t*t)o<t<T 
but not subject to impulses. Next, for a strategy 5 = (r n ,£ n ) n >i G .4, we denote by (Lf)o< t <T 
the process defined by 

L t = L t + En>l ^ 1 [r„<t] 

= x + /o L -> U s) ds + Jo CT ( S > ^O^" + En>l Cnl[r„<t] • 

Under iP u , the process L* 5 stands for the evolution of the system when controlled by (ut)o<t<T 
and subject to the impulse strategy 5. Note that the control and impulses are interconnected. 
The reward function associated with the pair (5, u) is 

J(S,u) =E u [f T h(s,L 5 ,u s )ds- y>(£n)l [Tn <T]], (7-18) 

J ° »>1 

where, is the expectation with respect to the probability measure TP U . With, £o = and 
ro = 0, we have 

[•T fTn+l 

\ h(s,L s ,u s )ds = h(s,L + £ 1 + ... + £ n ,u s )ds. 

n>0^ T " 
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The objective is to find a pair (5* , u*) such that 

J(5*,u*)= sup J(S,u). 
(s,u)eAxV 

Next let .ff be the Hamiltonian associated with the control problem, i.e., the function which 
with (t, w, z, u) G [0, T] x C x R d x V associates it;, 2, u) = za^ 1 ^, w)f(t, w, u) + h(t, w, u). 
The function H is Lipschitz w.r.t. z uniformly in (t, w, u) and through Benes Selection Lemma 
(cf. Benes (1970), Lemma 1), there exists aG® B(M. d ) — measurable function with values in V 
such that for any (t, w, z) G [0, T] x R d+d , 

H*(t, w, z) := sup H(t, w, z, u) = H(t, w, z, u*(t, w,z)). (7-19) 
uev 

Moreover, the function H* is Lipschitz in z uniformly w.r.t. (t, w) as a supremum over u G V 
of functions uniformly Lipschitz w.r.t. (t,w,u). 

For any stopping time v G T, and any .^-measurable random variable £, let 
(y n (£, u), Z n (^, v), K n (£, v)) n >o be the sequence of processes defined as follows. 

*tW)= / T ^(s,i.M+e,^°(C^))l[ s >H^- [ T Z° s (t,v)dM s , 0<t<T, (7.20) 
and, for any n > 1, 

(Y>, 0,^0,^(^0) G «5 C 2 x H 2 > d x S 2 . 

Y t n (v,0 = J H*(8,L. + £,Z?(Z,v))l [a > v] d8 + KZ(v,Z)-K?(i,,Z)- J Z?(v,0M e , t<T, 
Y t n (u, > 0?(i/, := max (—00?) + ^"V, £ + /?)), t < T, 

T 



(i?(f,0-o?(".OW.O = o. 



(7.21) 

We can easily see by induction that for any n > 0, the processes y n (£, f), Z n (£, v) and K n (^, v) 
are well defined, since if* is Lipschitz in z and f7 is finite. In addition, the process Y n (£, v) is 
continuous, since max ( g g [/(— ift(/3) + y^~ (i/, £ + /?)) < 0. Next, in view of Proposition 12,21 it 
holds that, for any n > 0, for any f and f , y n (£, i/) < y n+1 (£, ^) since y°(£, i/) < y x (£, i/). 

Now, according to (|7.20p and (|7.2ip . there are controls u n G V such that: 

Y t °(v,0 =E u °[j\(s,L. + Z,u s )l [s > l/] d S \F t }, 0<t<T, (7.22) 
and, for any n > 1, 

y t n (i/,0 =esssup TeTt E« n [^\( S ,L. + e,<)ll[ s > H & + l [T<T] 0™(^ai^], t <T. (7.23) 
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The last inequality is valid since K n (u,^) is non-decreasing and Y™{y, £) > lr T< rjO™(z/, £). 
Therefore, Y^{y,^) is greater than the expression inside the ess sup. On the other hand, there 
is equality when r = inf{s > t, K r s l (v, f ) - K\ l (v, f) > 0} A T. 

Now, by induction, as in the proof of Proposition 14.11 we obtain that, for any n > 0, r 
a stopping time and any ^v-measurable r.v. £;, the process Y n (y, £) satisfies the following 
property: 

0<Y t n (u,0<l(T-t), t<T, 

where, 7 is the constant of boundedness of h. Therefore, using Proposition 12.31 there exists a 
cadlag process (Y t *(v, C))t<T limit of the increasing sequence (Y n (u, £)) n >o as n — » 00. Moreover 
we have 

0<Y t *(i/,0<7(r-t), t<r. 
In the next proposition, we give a characterization of Y*{v,£). 

Proposition 7.1. The process Y* (u, £) is continuous. Moreover, there exist processes Z*(v,£) € 
ft 2 ' d and G S 2 swc/i ttat, /or a// i < T, 

Y t *(v,Z) = F H*(s,L. + Z,Z*(Z,v))l [s > 1/] ds + K^,0-K*(iy,0~ F Z* a {v,£)dM a , 
Jt Jt 

Y t *(v,0 > O t (u,0 :=max(-^/3) +Y t *(u,£ + P)) 

T 



(Y t *(u,0-O t (u,0)dK*(u,0 = 0. 



(7.24) 

Furthermore, for any pair and any stopping time u' > v, we have Y*{v,^) = Y*,(i>,£). 

Proof. Thanks to Proposition 12.31 there exists a process Z*(u, £) G 7Y 2,rf such that, for any 
p S [1,2), the sequence {Z n {y, £)) n >o converges to Z*(u, £) in 7Y p,ci . This convergence holds 
also weakly in 7i 2 > d . Additionally, there exists an increasing process K*(v, f) G 5 2 such that for 
any stopping time r the sequence (K™(y, £)) n >o converges to K*(y,£) in L p (dP). Therefore, 
we have 

lt*(",£) = /^•(s.L. + ^&i'))^]^ 
Y t *(v,0>O t (v,0 := max (-^/3)+Y/>,£ + /?)), < t < T. 

(7.25) 

The last inequality is valid, since U is finite. 
Next, for t < T, let us set 

R t = ess sup reTt E[^ T iT( S) L. + £, Z s *(£, v))l [s > v] ds + 1 [t<t] 0*(u, Ol-^l- (7-26) 
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Using Characterization (|2.ip of (Rt)o<t<T as a solution of a BSDE yields that, in using the 

comparison result (Proposition I2.2| ). for any t < T, Rt > y n (^, £) and then Rt > Y t *(v,^). 

On the other hand, a result by Peng and Xu (2005) implies that (Rt)o<t<T is the smallest 

H*(s,L. + £, 2r)lr s>I ,i-supermartingale which dominates Ot(z/, £) := max (—ip((3) + YTiv, £ + 
1 - J /Set; 

/5)). But, by (|7.25p . the process y*(i/, £) is a H*(s,L. + £, z)l[ s>I/ ]-supermartingale such that 
> O t (v,0 := max(-i>(P) + Y*(v,Z + {3)). Thus, y t *(i/,0 > Rt, for any t < T. Finally, 
since both processes are cadlag , then P-a.s., i? = y*(i/,£). This means that Y*(v,£) is equal 
to the second term in (|7.26p . Now, using the characterization of Theorem 12, 1\ it holds that 
Y*(v,£) and, Z*(v, £) and K*(u, £) satisfy (|7.24p . The continuity of y*(z/, £) is obtained in a 
similar fashion as in Theorem 16,11 since U is finite. 



Now, if v' > v then thanks to uniqueness result we have, for any n > 0, Y™{v,£) = Y$(v,£), 
and then it is enough to take the limit as n — > oo. □ 

In the same way as previously, for any admissible control u £ "V , a stopping time v, an JF^- 
measurable r.u. £ and n > 0, let us consider the sequence of processes defined recursively 
by 

Y t U '°(v, = ^ + £, Z s "' (£, i/),^)!^]^ - jT ^>°(£, v)M s , t < T (7.27) 

and, for any n > 1, 

r (y u ' n (i/,o ) ^ u,n (^0 5 ^ ,n ( i/ »0) e 5 c x ^ 2,d x ^ 

Y?'>, = jf H(s, L. + £, i/), u s )l [s > v] ds+ 

K^ n (u,0 - K^(u,0 - Jf Z? n (v,0M s , (7.28) 

y t w >>,0 >or>.0 :=max(-^) + yr~V,£ + /3)) 

^ and / T (^"^o-or(^e))^r(^e) = o. 

As above, the sequence of processes (Y u,n (v, £)) n >o is increasing and converges to a cadlag pro- 
cess Y u {y, £) which satisfies < y t u (^, £) < 7(T — i), for any t <T. We also have the following 

Proposition 7.2. T/ie process Y w (v,£) is continuous. Furthermore, there exist two processes 
(Z u (v,£),K u (v,£)) e H 2 ' d x S% such that, for all t < T, 



Y t u (u, £) > O t (u, £) := max (-^(/3) + Y t u (u, £ + /?)), 
/3G(7 

J? ~ O t (v,t))dK?(v,0 = o. 



(7.29) 
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Moreover, we have 

F U (0,0) = sup J(u, 5). 
8&A 

Proof. The proof of the two first claims is the same as the one of Proposition 17.11 It remains 
to show the last one. Indeed, since the triple (Y u (v, £), (Z u (v, £), K u (u, £)) satisfies 

< F t >, > O t (u, := max (-?/>(/?) + Y?(y, £ + 0)) and 
tf(Y t u ^O - O t (y,Q)dK?(y,Z) = 0. 

(7.30) 

it follows, as in Theorem 16.11 that Yq(0,0) = sup 5g _4 J(u, 5). □ 

We give now the main result of this section. 
Theorem 7.1. There exist a control u* £ "V and a strategy 5* = (t*, (3*) n >o € A such that 

J(5*,u*) = sup J (5, u). 

(J,«)e^xv 

In addition, 

F o "*(0,0) = J{S*,u*). 

Proof: Let u £ Y, then through the definitions of Y u (y,£) and Y*{y,£) it holds true that 
> Y u (v,£) since, in using the comparison result of Proposition 12.21 and an induction 
argument, we have Y* ,n (v, £) > Y u,n {y, £), for any n > 0. Hence, we have 

^o*(0,0) >Y U (0,0) = su P J(u,J), 
8eA 

and then 

y *(0,0)> sup J(S, u) > supsup J(u, 6). 
(S,u)eAxV uevseA 

Now, let u* and 5* be defined as follows. 

r* = inf{s > 0; O,(0, 0) = Y s *(0, 0)} A T, 

-ipffi) + y T } {rhPl) = waxpeui-W) + Y*. (0, (3)} = O t * (0, 0), 
u?l [t < T? ] = u*(t,L.,Z? (0,0)) 
and, for n > 2, 

r* = m/{ S > 1? + . . . + = O.fa^ft + . . . + AT, 

-^(/3*) + F T * ( r *, ft* + . . . + + ft) = max^-^/3) + ^ 0{ + ... + /?*_, + /?)} 

= T .(r*_ 1 ,/3 1 * + ... + ^_ 1 ) 
and utl^^it) = u*(t,L. +?{ + ... + ^(t*^,/?? + . . . + 

22 



Therefore, 

y *(0,0) = E U *[P h(s,L, u* s )ds + O T *(0, 0)l [Tf<T] ] 
Jo 

and as O r * (0, 0) = -^(ft) + ^ fa*, ft*) then 

y *(o,o) = E^tr^A^^ + i-^D + y^K,^))!^^]. 

Jo 

But, 

F**fa*,ft) =Y*(T?,(3t)+ P h( S ,L. + 0l,u*)ds- P Z* a (Tt,R)dB? 
= E«* [Y* fa , ft ) + L. + ft , «;)ds|^ ] • 

Plugging the last quantity in the previous equality to obtain 

Y *(0,0) = E U *[P h(s,L,u* s )ds+ r^L + ft^^^-^^Dl^^+niK^ft*)!^^]] 

JO Jtj 

[ P h(s, L 5 * , - V(ft)l W <T] + ft)] 

J o 

C"*[^ T2 fc( S ,L tf *X)cfc- ^(ft)l [Tf<T] +O r| (rr,ft)l [r|<T] ], 

since Y^(t*,P*) = y^(r^ft*)l [T * <T] , [r 2 * < T] C fa < T] and finally 
y^(r 1 %ft) = 0; | (r 1 *,ft)l [T|<T] . 

Repeating now this reasoning as many times as necessary to obtain, for all n > 1, 
y *(0,0) =E u *[Ph(s,L 5 \u* s )ds- Wfe)%<T]+0< +1 «,ft + ... + ft)l [T . +i<T] ]. 

■ y ° fc=l,n 

This property implies first that the strategy 5* is finite since Y*(0, 0) is a real constant. On 
the other hand taking the limit as n — > oo to obtain: 

y*(o,o) = J(«*,5*). 

Thus, 



E" 
E u 



y*(0,0) = J(S*,u*) = sup sup J(S,u), 



and the proof is complete. □ 
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Appendix 



Let 9 (resp. n) be the optional (resp. predictable) tribe onflx [0, T], i.e., the tribe generated by 
the cadlag and .^-adapted processes X = (X t )o<t<r (resp. the left continuous and .Fj-adapted 
processes Y = {Yt)t<T )■ 

Definition 7.1. A measurable process U = (JJ)t<T is said to be of class [D] if the set of 
random variables {U t ,t G T} is uniformly integrable. 

Proposition 7.3. Let U = (U) t <T be an optional process which is of class [D] and N = 
(Nt)t<T the Snell envelope of U defined by: 

N t = ess sup TeTt E[U T \F t ], t < T. 

If U is right upper semi- continuous, then the process N is continuous. 

Proposition 7.4. Let {U)t<T be an optional process of class [D] and N its Snell envelope. 
Then 

(i) there exist a martingale M and two increasing, integrable and right continuous processes 

A and B such that, 

N t = M t -A t -Bt, 0<t<T. (7.31) 

The process A is optional and continuous, and B is predictable, i.e., ir— measurable and 
purely discontinuous. This decomposition is unique. In addition for any t <T we have: 

{A t B > 0} C {U t - = N t -} (7.32) 

and 

A t B = (U t ^ - N t _) + l [AtU<0] . (7.33) 

(ii) IfY^S 2 and M is a continuous martingale with respect to T , then the processes A and 

B are also in S 2 . 
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